RDF-based Data Sharing of Bio-resource Related Information
نویسندگان
چکیده
“Bio-resources”, commonly used biological materials for experimental studies such as mouse strains, cell lines and microbe culture collections are crucial fundamentals to provide reproducibility and reliability of data in life science. To provide advanced infrastructure of life science, wider-dissemination, quality control and standardization of bio-resources are required. In this sense, data of bio-resources and related information also should be broadly “shared” in life science community. Standardized methodology of data handling across databases and software applications, which helps to maximize utility and re-use of released data is also important issues. Resource Description Framework (RDF) and Semantic Web technologies provide suitable infrastructure for wide dissemination and re-use of bio-resource related information. Therefore, we have worked out construction of RDF data of bioresources (mouse strains, rat strains, medaka strains, cell lines and microbe strains) collected from multiple resource centers in Japan. We adopted community-developed data schemas for cell lines (Cell Line Ontology: CLO) and microbes (Microbial Culture Collection Vocabulary (MCCV). For descriptions of phenotypic properties of bio-resources, we designed common schema links to ontologies of phenotype (e.g. Mammalian Phenotype Ontology and Zebrafish Phenotype Ontology), body parts (e.g. Adult Mouse Anatomy, and Zebrafish Anatomy) and Phenotypic Quality (PATO). Constructed RDF data are available from RIKEN Meta Database (http://metadb.riken.jp), which provides web-based interfaces of relational-database like data viewer with table and card interfaces, bulk data download function and SPARQL endpoint. Each database projects in RIKEN Meta Database is accessible from the portal site J-phenome (http://jphenome.info). RDF-version datasets of bio-resources help coordination across multiple databases. Common data schema of bio-resource related datasets easily enables cross-dataset search of resources showing related phenotypes classified as a specific category. In addition, we are planning to collaborate with MicrobeDB.jp, which is an integrated database of microbial metagenomes, for sharing the latest RDF data of microbial strains in RIKEN BioResource Center. We expect that RDF-based data coordination will contribute to global sharing and improvement of utilities of bio-resources. Screenshots of the metadata of BRC mouse resources and phenotypes (http://metadb.riken.jp/metadb/db/rikenbrc_mouse)
منابع مشابه
GlycoRDF: an ontology to standardize glycomics data in RDF
MOTIVATION Over the last decades several glycomics-based bioinformatics resources and databases have been created and released to the public. Unfortunately, there is no common standard in the representation of the stored information or a common machine-readable interface allowing bioinformatics groups to easily extract and cross-reference the stored information. RESULTS An international group...
متن کاملSPARQL2OWL: Towards Bridging the Semantic Gap Between RDF and OWL
Several large databases in biology are now making their information available through the Resource Description Framework (RDF). RDF can be used for large datasets and provides a graph-based semantics. The Web Ontology Language (OWL), another Semantic Web standard, provides a more formal, modeltheoretic semantics. While some approaches combine RDF and OWL, for example for querying, knowledge in ...
متن کاملSharing, Discovering and Browsing Photo Collections through RDF geo-metadata
In recent years the growth in popularity of digital photography, together with the development of services and technologies to annotate and organize data on the Web, have extended the possibilities for managing and sharing large numbers of pictures. Our work explores the kinds of metadata that can be captured at the time a photo is taken, and ways to share these metadata in order to create a br...
متن کاملResource Description Framework (RDF)
The Resource Description Framework (RDF) is the standard knowledge representation language for the Semantic Web, an evolution of the World Wide Web that aims to provide a well-founded infrastructure for publishing, sharing and querying structured data. This article provides an introduction to RDF and its related vocabulary definition language RDF Schema, and explains its relationship with the O...
متن کاملPredictive Modeling using Features derived from Paths in Relational Graphs
This paper is concerned with supervised learning in relational domains. As relational framework, we use the Resource Description Framework (RDF) that is the basis for representing information about resources in the World Wide Web. The fundamental RDF structure is a relational graph such that feature derivation can be formulated in a simple graphical context. We present learning solutions for li...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015